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VERSION 
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SOURCE 
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REFERENCE 
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TITLE 
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gene 
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ECPEC029 3895 bp DNA circular BCT 06-OCT-1997 

Escherichia coli plasmid pEC029 DNA, complete sequence. 
AJ001708 

AJ001708.1 GI:2660511 

complete sequence; DNA methyltransf erase; eco29kIM gene; 
gene; plasmid; restriction endonuclease . 
Escherichia coli. 
Plasmid Escherichia coli 

Bacteria; Proteobacteria ; gamma subdivision; Enterobacteriaceae; 
Escherichia . 

1 (bases 1 to 3895) 
Solonin, A. S . 
Direct Submission 

Submitted ( 06-OCT-1997 ) Solonin A.S., Molecular Biology, Institute 
of Biochemistry and Physiology of Microorganisms, IBPM RAS, 
Pushchino, Prospekt Nauki, 5, Moscow Region, 142292, RUSSIA 

2 (bases 1 to 3895) 

Pertzev,A.V. , Ruban,N.M. , Zakharova,M. V. , Beletzka ja, I . V. , 
Petrov, S.I., Kravetz,A.N. and Solonin,A.S. 

Eco29kI, a novel plasmid encoded restriction endonuclease from 
Escherichia coli 

Nucleic Acids Res. 20 (8), 1991 (1992) 
92253423 

Location/Qualifiers 
1. .3895 

/organism="Escherichia coli" 

/plasmid="pEC029" 

/strain="29k" 

/db_xref="taxon: 562" 

390. .1034 

/gene="eco29kIR" 

390. .1034 

/gene="eco29kIR" 

/EC_number="3 . 1.21.4" 

/function="DNA restriction" 

/ codon_start=l 

/transl_table=ll 

/evidence=experimental 

/product^" restriction endonuclease" 

/protein_id="CAA04943 . 1" 

/db_xref="GI : 2660512" 

/db_xr e f = " S PTREMBL : Q4 6 9 4 4 " 

/translation="MHNKKFDRSEHVYRNDSFLELIKDAVRFFSGTPVHSLPPPERFQ 

GAGVYALYYTGHYSLYDEYSRINRLAYNLPIYVGKAVPAGWRQSRISDHETRAGSELS 

NRIREHGRNIAKTSNLDLCDFSCRFVIFEATGSDMISTVEAALIKIYKPLWNTWDGF 

GNHTPGAGRFAQAKSDWDVIHPGREWAEKCTGVHSEPYFIEERIKQYFSKSNFT" 

1034. .2182 

/gene="eco29kIM" 

1034. .2182 

/gene="eco29kIM" 

/function="DNA modification" 

/codon_start=l 

/transl_table=ll 

/product="DNA-methyltransf erase" 
/protein_id="CAA04944 . 1" 



/db_xref="GI: 2660513" 
/db_xref="SPTREMBL:Q59380" 

/translation="MRSLEIFSGAGGLAKGLELAGFQHVGFVELNKHACDSLRLNFDE 
EKVFQGDIKNYDLSSLDKIDIVAGGPPCQPFSLGGKHKAHDDSRDMFPFAIKAIEVLQ 
PKAFIFENVKGLLRKSFADYFEYIILRLTYPCLSLSNVSDWRIHLDVLRKARKDKHCG 
CTYDVSFKLINAADYGVPQIRERVI IVGVRSDLNKKWSFPSVTHTQEKLLWEQHVTGD 
YWRKHNI DAVI DKGI SERLMKRFGI FP PSGKSWVTVRDALKNVPQP YESHGI GDHI FK 
DGARVYPGHTGSYIDLPSKTLKAGAHGVPGGENMIRYEDGTVRYFTVYEGKLLQTFPR 
EFLIAGAWGE7\MRQIGNAVPVLLAQKIGQQLYNLLNEEVEPLLCSNFN" 
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Query Match 84.0%; Score 368; DB 1; Length 3895; 

Best Local Similarity 94.5%; Pred. No. 2.3e-97; 

Matches 414; Conservative 0; Mismatches ^^20} Indels 4; Gaps 3; 

aaaaacgaagaccccagaaaaggccgcgccggaggcgctttttcGataggctccgcGCGC 63 

I I I I I III III I I I I I I I II II I I I I II I M I I I I I I I I I I I I I I I II M I 
AAAAAACGAGAACCCGGAAAAGGCCACGCCGCTGGCG-TTTTTCCATASSCTCCGCGCCC 2810 

ctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggact-t 122 
M I M I I I I I M I I I I I I I I I I I I I M M I I I I I I I I M I I I I I I I I I I I M I M I M I 
CTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTAT 2750 

aaagataccaggcgtttcccccc^'^aagctccctcgtgcgctctcctgttccgaccctgc 182 
M M M M M M M M I I I I I I I I M M I I I I I I I M I I I I I I I I I I I I I I M M M I I 

AAAGATACCAGGGGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGC 2690 

cgcttaccggatacctctccgcctttctcccttcgggaagcgtggcgctttctcatagct 242 
I I I I I I I I M M M I I I I I I I I I M I I I I I I I I I I I M I M I I I I I I I I I I I I I M M I I 

CGCTTACCGGATACCTCTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCT 2630 

cacgctgttggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacg 302 
I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I M M I M M I I I I I 
CACGCTGTTGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACG 257 0 

aaccccccgttcagcccgaccactgcgccttatccggtaactatcgtcttgagtccaacc 362 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I 
AACCCGCCGTTCAGCGCGAGCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACC 2510 

cggtaagacacgactttacgccactggcagcagccattggtaactg — aaaagtggattt 420 

I II II I II I I I I III I I I I I I I I I I I I M I I I I I I M M I I I I I II I I I M M 
CGGTAAGACACGCCTTAACGCCACTGGCAGCAGCCATTGGTAACTGGACAAGGTGGATTG 2450 
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